MEMORANDUM 

RM-2972-ARPA 

FEBRUARY  1962 


A  SAMPLING  INSPECTION  PROBLEM 
IN  ARMS  CONTROL  AGREEMENTS: 
A  GAME-THEORETIC  ANALYSIS 

Melvin  Dresher 


PREPARED  FOR: 

ADVANCED  RESEARCH  PROJECTS  AGENCY 


MOD 


SANTA  MONICA  •  CALIFORNIA 


MEMORANDUM 

RM-2972-ARPA 

FEBRUARY  1962 


ARPA  ORDER  NO.  189-61 


A  SAMPLING  INSPECTION  PROBLEM 
IN  ARMS  CONTROL  AGREEMENTS: 
A  GAME-THEORETIC  ANALYSIS 

Melvin  Dresher 


This  research  is  supported  and  monitored  by  the  Advanced  Research  Projects  Agency 
under  Contract  No,  SD-79.  Any  views  or  conclusions  contained  in  this  Memorandum 
should  not  be  interpreted  as  representing  the  official  opinion  or  policy  of  ARPA.  Permis¬ 
sion  to  quote  or  reproduce  must  be  obtained  from  rand. 


74e 


md 


1700  MAIN  ST.  •  SANTA  MONICA  •  CALIFORNIA 


RM-2972 

iii 


PREFACE 


An  important  operational  problem  in  arms-control  organization  is 
the  design  of  a  sampling  inspection  system  that  gives  maximum  insurance 
that  Ein  agreement  is  not  violated.  Unlike  sampling  in  manufacturing  or 
government,  sampling  for  arms  control  must  take  into  account  that  the 
statistical  population  may  be  altered  to  conceal  a  violation.  In  many 
respects  sampling  for  arms  control  resembles  a  game  of  strategy 
between  the  inspector  and  the  inspected. 

This  RAND  Research  Memorandum  analyzes  as  a  gcime  of  strategy 
a  problem  in  sampling  for  arms  control  where  the  inspector  is  allowed  a 
fixed  number  of  inspections  to  use  on  a  sequence  of  items  or  events,  each 
of  which  might  contain  or  conceal  a  violation.  Results  are  presented  on 
optimal  inspection  procedures.  The  results  may  be  of  interest  to  technical 
personnel,  particularly  statisticians  or  sampling  survey  designers,  who 
have  advisory  roles  in  arms-control  planning. 

This  research  is  a  part  of  RAND's  program  of  arms-control  studies 
for  the  Advanced  Research  Projects  Agency. 
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SUMMARY 

Many  disarmament  or  arms— control  agreements  may  be  monitored 
by  sample  inspections.  Unlike  the  usual  sampling  procedures,  sampling 
for  arms— control  agreements  must  take  into  account  the  possibility  that 
the  statistical  universe  from  which  samples  are  to  be  drawn  may  be 
tampered  with  so  as  to  decrease  the  probability  of  detection  of  a  violation. 

In  this  report  the  author  formulates  a  game— theoretic  model  for 
studying  a  sampling  problem  in  which  the  inspector  is  allowed  to  examine 
a  fixed  number  (usually  small)  of  items  or  natural  events  (e.g. ,  items 
from  an  assembly  line  under  an  agreement  limiting  military  production, 
or  seismic  events  under  a  nuclear  test-ban  agreement).  It  is  assumed 
that  the  inspections  are  to  be  performed  within  a  fixed  time  period  or  on 
a  series  of  events  of  fixed  length. 

Optimal  sampling  procedures  are  derived  as  functions  of  the  number 
of  inspections  and  the  size  of  the  statistical  universe.  Some  variations 
on  the  model  are  briefly  considered. 
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A  SAMPLING  INSPECTION  PROBLEM  IN 
ARMS  CONTROL  AGREEMENTS  : 

A  GAME-THEORETIC  ANALYSIS 

1.  INTRODUCTION 

Statistical  investigations  in  science,  industry,  or  census  taking 
generally  involve  sampling  a  universe  that  has  not  been  modified  to 
mislead  the  investigators.  In  sampling  for  disarmament  (or  arms^ 
control)  verification,  we  must  assume  that  the  statistical  universe  can 
be  tampered  with  to  conceal  a  violation.  The  usual  sampling  theory, 
therefore,  is  not  sufficiently  applicable  to  sampling  problems  for 
disarmament  verification.  In  many  situations,  sampling  to  monitor 
disarmament  agreements  can  be  formulated  as  a  zero— sum  two— person 
game. 

Another  problem  in  which  a  statistical  universe  may  be  tampered 
with  occurs  in  certain  weather— control  and  medical  experiments. 
Blackwell  and  Hodges  have  analyzed  this  problem  in  Annals  of  Mathe¬ 
matical  Statistics,  Vol.  28,  1957,  pp.  449-460. 

This  study  analyzes  as  a  game  the  sampling  problem  of  selecting 
a  fixed  number  of  items  for  close  inspection  out  of  a  series  of  items. 
This  problem  would  arise,  for  example,  in  the  following  arms— control 
or  disarmament  agreements: 

(1)  A  ban  on  underground  nuclear  tests  verified  by  on— site 
inspections  of  a  limited  number  of  those  seismic  shocks  that  cannot 


be  identified  by  seismology  to  be  earthquakes. 
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(2)  A  demilitarized  zone  such  that  its  continued  demilitarization 
would,  in  part,  be  verified  by  a  fixed  number  of  inspections  of  incoming 
vehicles  (railroad  cars,  trucks,  planes). 

(3)  An  agreement  to  prohibit  weapons  of  mass  destruction  in  space, 
to  be  verified  by  inspecting  a  fraction  of  all  space  launchings. 

(4)  A  prohibition  of  the  production  of  certain  military  equipment 

to  be  verified  by  a  limited  number  of  inspections  per  year  of  the  production 
of  related  civiliain  equipment. 

In  each  of  the  above  cases,  the  inspector  must  find  a  compromise 
between  two  conflicting  arguments.  The  inspections  must  not  be  early, 
on  account  of  possible  later  violations,  nor  must  they  be  late,  on  account 
of  possible  early  violations.  The  potential  violator  also  must  compromise 
between  two  opposing  arguments.  Stated  in  this  form,  the  design  of  an 
inspection  system  is  a  problem  in  the  timing  of  decisions  in  a  competitive 
environment,  or  a  game  of  timing. 

In  this  paper  we  formulate  and  analyze  this  game  of  timing  in  a 
simplified  form:  the  violator  chooses  a  time  for  a  single  violation,  with 
the  option  of  refraining  from  any  violation;  and  the  inspector  chooses 
times  for  on— site  inspections. 

In  order  to  formulate  the  problem  as  a  zero-sum  game,  in  the 
terminology  of  game  theory,  we  must  assign  relative  weights  for  the 
advantage  to  the  violator  of  an  undetected  violation,  as  compared  with 
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the  advantage  to  the  inspector  of  detecting  a  violation.  We  give  a  complete 
solution  in  case  these  advantages  are  equal,  and  a  partial  solution  if  they 
.  are  unequal.  We  also  formulate,  but  do  not  solve,  a  game  in  which  the 
evader  may  try  more  than  one  violation. 

Objections  may  be  made  that  the  problem  is  oversimplified,  but 
it  will  be  seen  that  the  problem,  as  stated,  does  introduce  many  essential 
elements.  For  a  qualitative  discussion  of  the  results  see  Sec.  9. 

2.  FORMULATION  OF  PROBLEM 

Suppose  that  n  consecutive  events  are  expected  during  a  time 
period.  Each  of  these  n  events  could  contain  or  conceal  a  violation  of 
an  agreement.  Only  by  inspection  can  it  be  determined  whether  the  event 
contains  a  violation.  It  is  impossible,  however,  to  inspect  every  event. 
The  number  of  inspections  allowed  may  be  much  smaller  than  the  number 
of  events  that  might  conceal  a  violation. 

Suppose  that  m  inspections  are  available,  where  m  <  n.  How 
should  the  Inspector  select  these  m  inspections  from  the  n  events? 

Suppose  that  at  most  one  of  these  n  events  will  contain  or  conceal 
a  violation.  How  should  the  Violator  place  this  violation,  if  any,  among 
the  n  events? 

On  our  assumptions,  the  Violator  wishes  to  pick  the  best  time 
(i.  e. ,  the  best  one  of  n  events)  for  his  violation,  and  the  Inspector  wishes 
to  pick  the  best  m  of  the  n  events  for  his  inspections. 
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Now  the  object  of  the  inspections  is  to  detect  a  violation,  if  any 
occurs.  Further,  we  assume  that  the  Violator's  object  is  to  evade  these 
inspections.  Hence  we  may  define  the  payoff  to  the  Inspector  as  follows: 

+1,  if  the  Inspector  detects  a  violation, 

(1)  —1^  if  the  Inspector  fails  to  detect  a  violation  that  occurs, 

O^if  there  is  no  violation. 

The  payoff  to  the  Violator  will  be  the  negative  of  (1). 

The  assumption  that  the  cost  of  an  inspection  that  does  not  detect 
a  violation  is  the  same  as  the  gain  if  a  violation  occurs  is,  of  course, 
arbitrary.  Other  values  could  be  assigned  to  these  outcomes,  as  will  be 
seen  in  Sec.  8. 

We  also  need  to  specify  the  information  available  to  the  two  sides. 
First,  we  shall  assume  that  both  sides  know  m  and  n  at  the  start  of 
the  game.  Further,  at  all  times  the  sides  know  the  state  of  the  game  — 
i.  e. ,  they  know  the  number  of  events  and  inspections  remaining  for  the 
rest  of  the  period. 

3.  VALUE  AND  OPTIMAL  STRATEGIES  OF  GAME 

We  shall  analyze  the  following  game  with  n  moves,  each  move 
being  a  simultaneous  choice  by  both  sides.  At  the  start  of  the  game  the 
Inspector  has  available  m  on— site  inspections  to  detect  a  violation  during 
the  n  moves.  A  move  by  the  Inspector  is  a  choice  of  whether  or  not  to 


inspect  a  specific  event.  A  move  by  the  Violator  is  a  choice  of  whether 
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to  introduce  a  violation  during  that  time  defined  by  the  event.  These 
choices  are  made  simultaneously  by  the  two  sides.  After  the  choices  are 
made  by  the  sides,  and  the  move  is  completed,  the  Violator  knows  the 
Inspector's  choice  for  that  move.  It  is  assumed  that  if  a  violation  occurs 
and  the  Inspector  inspects,  he  detects  it.  This  assumption  could  be 
modified. 

It  is  evident  that  the  value  of  the  game  and  optimal  strategies  will 
depend  on  the  number  of  inspections  and  the  number  of  moves  available. 
Let  us  then  define  the  function  t  v(m,  n)  by 

v(m,n)  =  the  value  of  the  game  in  which  the  Inspector  has 

m  inspections,  and  the  Violator  has  one  violation 
during  n  events . 

Then,  using  (1),  v(m,  n)  is  the  value  of  the  2x2  matrix  game 

Inspect 

(2)  Inspector 

Not  Inspect 

with  initial  conditions 

(3)  v(0,n)  = -1, 


Violator 

Violate  Not  Violate 


n  >1, 

m  > n  >  0. 


v(m,  n)  =  0, 
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From  the  description  of  the  payoff  in  this  game,  it  follows  that 


(4)  1  >  v(m,  n— 1)  >  v(m— 1,  n— 1)  >— 1. 

Therefore,  the  above  2x2  matrix  does  not  have  a  saddle— point  except 
when  m  =  n.  Solving  the  game  defined  by  (2),  we  obtain  v(m,  n)  as  a 
solution  of  the  functional  equation 


(5) 


v(m,  n) 


v(m,  n— 1)  +  v(m— 1,  n— 1) 
v(m,  n— 1)  —  v(m— 1,  n— 1)  +  2 


Figure  1  shows  how  v(m,  n)  changes  with  m  for  n  =  4,  5,  6,  10, 
20,  50,  100.  /' 

Solving  the  game  defined  by  (2), we  also  obtain  the  following  optimal 
strategy  vectors: 


(6) 


_ v(ni,n) _ 

v(m,  n— 1)  +  v(m— 1,  n— 1) 


for  Inspector, 
if  n  >  m  ; 


( 


for  Inspector,  if  n  =  m,- 


_ v(in,  n) _ 

v(m,  n— 1)  +  v(m— 1,  n— 1) 


/vim,  n— 1)  -  v(m— 1,  n-lr 
2 


for  Violator. 


Now  in  the  special  case  in  which  n  =  m,  the  payoff  matrix  (2) 


becomes 


Inspector's  poyoff  - 
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Fig.  I  —  Expected  payoff  to  Inspector 
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The  Violator  thus  has  a  unique  optimal  strategy:  not  violate.  However, 
the  Inspector  has -an  infinite  number  of  optimal  strategies;  in  fact, any 
strategy  of  the  form 


where  0  <  «  <— ,  is  optimal.  But  since  m  =  n,  the  Inspector  has  an 
inspection  for  each  event.  It  seems  good  sense  for  the  Inspector  to  use 
the  inspection  at  each  event.  Thus  if  n  =  m,  the  "preferred"  optimal 
strategy  for  the  Inspector  is 


Figure  2  shows  how  the  probability  of  inspection  (and  hence  the  proba¬ 
bility  of  detecting  a  violation  if  it  occurs)  changes  with  n  for  m  =  3,  6,  11. 


4.  SOLUTION  FOR  m  =  1 

Before  solving  (5)  in  general,  let  us  first  solve  this  equation  for 
m  =  1.  We  are  seeking  a  solution  of 


v(i  n)  =  v(l?n-l)  +v(0,n-l) _ 

v(l,  n-1)  -  v(0,  n-1)  +  2 


v(l,  n-1)  - 1 
v(l,  n— 1)  +  3 
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O 


Optimal  strategy  of  Inspector 
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Let 


w  = 


n  v(l,  n)  +  1’ 


or 


v(l,n)  - - 1. 

w 

n 


In  terms  of  the  new  variable,  (7)  may  be  written  as 


i - 1  = 

w 

n 


w 


n-1 


w 


+  2 


n-1 


Simplifying,  we  get  the  recursion  equation 


w  =  —  +  w 
n  2  n— 1 


This  yields  as  a  solution 


n  2  2 


.  + 


1  n  +  1 


or 


(8) 


v(l,  n)  = - 1  = - — 

w  n  +  1 

n 


In  a  similar  msuiner,  using  (8),  we  get 


n  +  n  +  2 
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and 

X  _  (n-1)  (n-2)  (n-3) 

v(3,n)  = - r -  . 

n  +  5n  +  6 

5,  GENERAL  SOLUTION 

We  shall  now  prove  that  the  general  solution  of  (5),  for  all  m  and 
n,  is  given  by 

(9)  v(m,n)  =  - 

Our  method  of  proof  consists  of  verifying  that  v(m,n),  as  defined 
by  (9),  satisfies  the  functional  equation  given  by  (5).  The  recursion 
relation  (5)  shows  that  the  solution  is  unique. 

First,  let  us  evaluate  v(m, n+1)  in  terms  of  v(m,n).  From  (9) 
we  obtain 


(10) 


v(m,  n+1) 


Now  by  Pascal's  rule  we  have 
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Hence  we  obtain 


m  p+l\  m  /  n  \  m  /n 

E  V  i/  =  2  li-lj  +  2 

i=0  i=0  i=0 


■‘O 


+  E  ' 
i=0 


Xi)  -O 


Substituting  the  last  expression  in  (10)  and  using  (9),  we  get 


v(m,  n+1)  =  — 


^  m  -  0 


v{m,n) 


But  since 


this  reduces  to 


v(m,  n+1) 


_ 2 _  ^_n_ 

v(m,  n)  n  —  m 


Simplifying  the  last  expression,  we  get 


(11)  v(m,  n+1) 


_ nv(m,  n) _ 

2(n— m)  +  >nv(m,  n) 
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Now  let  us  obtain  an  expression  for  v(m+l,  n)  in  terms  of  v(m,n). 


Using  (9),  we  have 


v(m+l,  n) 


But 


fn—l\  / ( n— 1^  _  n  —  m  —  1 

\m+y/  m  +  1 


n 

m  +  l 


whence 


v(m+l,  n) 


i;  n  —  m  —  1 
m  +  1 

1 _ n 

v(m,  n)  m  +  l 
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Simplifying, we  get 


(12) 


v(m+l,  n) 


(n— m— 1)  v(m,n) 
(m+1)  —  nv(m,  n) 


6.  VERIFICATION  OF  SOLUTION 

We  are  now  prepared  to  show  that  (9)  satisfies  (5).  We  may  write 
(11)  as  follows: 


(13) 


,  ^  (n-l)v(m,  n-1) 

"  2(n-l-m)-Kn-lMm,n-l)  ' 


We  may  write  (12)  as  follows: 


(14) 


v(m,  n— 1) 


(n— m— 1)  v(m— 1,  n— 1) 
m  —  (n— 1)  v(m— 1,  n— 1) 


—  y  v(m— 1,  n— 1) 


n-1 

m 


v(m— 1,  n— 1) 


From  the  last  expression  we  have 


v(m,  n— 1) 


n-1 

m 


v(m,  n— 1)  v(m— 1,  n— 1)  = 


n— 1 
m 


v(m— 1,  n— 1)  —  v(m— 1,  n— 1) . 
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Solving  this  for 


n-1 


m 


we  obtain 


n— 1  _  v(m— 1,  n— 1)  +  v(m,  n— 1) 

m  v(m— 1,  n— 1)  [v(m,n— 1)  +  l] 


Now  from  (13)  we  have 


v(m,  n) 


v(m,  n— 1) 
m  ’ 


-  2  + 


n-1 

m 


v(m,  n— 1) 


(15) 


v(m,  n— 1) _ 

2  —  2  +  v(m,  n— 1) 

n— i 


_ v(m,  n— 1) _ 

2  v(m— 1|  n— 1)  t  v(m,  n— 1)  +  1] 
v(m— 1,  n— 1)  +  v(m,  n— 1) 


+  v(m,  n— 1) 


For  convenience  in  writing,  let  us  set 


a  =  v(m— 1,  n— 1), 
b  =  v(m,  n— 1)  . 


Then  (15)  becomes 


v(m,  n) 


b 

2  -  2a(b+l)  +  b 


a  +  b 
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_ b(a  +  b) _ 

(2  +b)  (a  +  b)  -  2a(b+l) 


b(a  +  b) 
b(b  —  a  +  2) 


Hence  we  have  - 


,  .  v(m— 1,  n— 1)  +  v(m,  n— 1) 

'  v(m,  nil)  -  v(m-l,  n-1)  V  2  ’ 

which  agrees  with  (5).  We  have  thus  proved  that  (9)  is  a  solution  of  (5). 

Using  (9),  we  can  also  express  the  optimal  strategies,  (6),  as 
functions  of  m  and  n,  as  given  in  Sec.  3. 

7.  SOME  EXAMPLES  OF  OPTIMAL  STRATEGIES 

Table  1  gives  the  optimal  strategy  of  the  Inspector  for  five  values 
of  n  (9,  10,  20,  50,  100)  and  five  values  of  m  (1-5).  For  example, 
the  entry  at  m  =  2,  n  =  20,  means  that  whenever  there  are  20  periods 
to  go  and  the  Inspector  has  two  inspections  left,  he  should  inspect  at  the 
next  period  with  probability  0.  095. 
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Table  1  —  Optimal  Strategy  of  Inspector 
(Probability  of  inspecting  the  next  event  if  the  Inspector 
has  m  inspections  left  and  there  are  n  periods  to  go) 


n 

m 

9 

10 

20 

50 

100 

1 

0.100 

0.091 

0.048 

0.019 

0.010 

2 

0.195 

0.179 

0.095 

0.039 

0.020 

3 

0.  285 

0.261 

0.141 

0.059 

0.030 

4 

0.  363 

0.  337 

0.187 

0.078 

0.039 

5 

0.427 

0.401 

0.232 

0.098 

0.049 

Table  2  gives  the  optimal  strategy  of  the  Violator  for  the  same 
values  of  m  and  n  as  in  Table  1.  For  example,  the  entry  at  m  =  3, 
n  =  10,  means  that  whenever  there  are  ten  periods  to  go  and  the  Inspector 
has  three  inspections  left,  the  Violator  should  schedule  a  violation  at 
the  next  period  with  probability  0.082,  provided  no  violation  has  yet 


occurred. 


RM-29  72 
18 


Table  2  —  Optimal  Strategy  of  Violator 
(Probability  of  scheduling  a  violation  during  the  n— th  event 
if  the  Inspector  has  m  inspections  left  and  if  no 
violation  has  yet  occurred) 


n 

m 

9 

10 

20 

50 

100 

1 

0.100 

0.091 

0.048 

0.020 

0.010 

2 

0.095 

0.087 

0.047 

0.020 

0.010 

3 

0.087 

0.082 

0.047 

0.020 

0.010 

4 

0.075 

0.073 

0.  046 

0.019 

0.009 

5 

0.056 

0.060 

0.045 

0.019 

0.009 

8,  GENERALIZATION  OF  THE  MODEL 

As  already  stated,  we  have  studied  an  extremely  simplified  model 
of  the  game  of  sampling  in  a  competitive  environment,  a  model  chosen 
only  to  illustrate  the  principles  involved.  No  great  difficulty  arises  in 
carrying  out  analogous  arguments  for  more  general  games  of  sampling. 

In  the  simple  model  the  payoff  function  implies  that  the  payoff  to 
the  Inspector  if  he  detects  a  violation  is  the  same  as  the  payoff  to  the 
Violator  if  a  violation  is  not  detected.  We  can  remove  this  implication  by 
making  the  payoffs  different,  in  which  case  v(m,  n)  is  obtained  as  the 
value  of  the  following  matrix  game  : 
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c 

where  c  >  0  is  the  payoff  to  the  Inspector  if  he  detects  a  violation.  The 
value  of  the  game  satisfies  the  functional  equation 

Wm  =  cv(m,  n-1)  +  v(m-l,  n-1) 

’  v(m,  n— 1)  —  v(m— 1,  n— 1)  +  1  +  c 


v(m— 1,  n— 1) ' 
v(m,  n— 1) 


In  the  special  case  m  =  1  (one  inspection  available)  we  can  evaluate  v 
explicitly, obtaining 


(16) 


v(l,  n) 


n-1 
n  +  c 


The  optimal  strategies  in  this  case  are 


(17) 


for  the  Inspector,  if  n>  2, 


1 

n  +  c 


,  for  the  Violator,  if  n>2. 


If  n  =  1,  however,  the  optimal  strategies  are 
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for  the  Inspector, 


for  the  Violator. 


If  c  =  0  (i.e.,  the  Violator  suffers  no  penalty  if  a  violation  is 
detected)  we  can  evaluate  v(m,n)  explicitly.  In  this  case,  we  have 


v(m,  n)  *  - 


n— m 
n 


The  optimal  strategies  are 


n 

n 


for  the  Inspector^ 


for  the  Violator. 


The  simple  model  further  assumes,  in  effect,  that  the  Violator  is 
capable  of  at  most  one  violation  during  the  n  events.  Suppose  we  generalize 
this  by  assuming  that  the  Violator  is  capable  of  n  violation  attempts, 
one  at  each  of  the  n  events.  Thus  if  a  violation  is  not  detected,  the 
Violator  can  have  additional  violations  at  later  periods.  The  game 
terminates,  however,  when  a  violation  is  detected.  Now  v(m,  n)  is  the 
value  of  the  following  2x2  matrix  game: 
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-1  +  v(m,  n— 1) 


v(in— 1,  n— 1)' 
v{in,  n— 1) 


or 


,  ,  v(m,  n— 1)  +  v(in— 1,  n— 1)  —  v(m,  n— 1)  v(m— 1,  n— 1) 

"> - 2  -  v(m-l,  n-1) - 


Further,  we  have  assumed  a  fixed  number  of  inspections  to  be  used 
in  a  fixed  time  period.  It  turns  out  that  if  the  Inspector  uses  an  optimal 
strategy  he  may,  with  probability  larger  than  zero,  use  all  m  inspections 
earlier  than  n  periods.  The  Violator  can  then  be  sure  of  success  if  he 
acts  after  the  m-th  inspection,  and  before  the  n  periods  have  expired. 

One  way  of  rectifying  this  situation  is  by  declaring  arbitrarily  that  when¬ 
ever  m  =  0  and  n  >  0,  there  is  a  probability  that  an  additional  inspection 
will  be  allowed.  This  will  yield  a  different  game  and  different  optimal 
strategies . 

9.  DISCUSSION  OF  RESULTS 

It  is  of  interest  to  compare  the  results  of  using  an  optimal  strategy  with 
other  possible  strategies.  We  can  readily  make  this  comparison  for  the  case 
in  which  just  one  inspection  and  one  violation  are  possible.  Let  us  do  this 
for  the  more  general  model  in  which  the  Violator  gains  one  unit  by  an  un¬ 
detected  violation  and  the  Inspector  gains  c  units  if  he  detects  a  violation. 
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We  recall  that  in  this  case  the  value  of  the  gaime  (the  expected  loss  by 
the  Inspector)  is  (n— l)/(n+c),  where  n  is  the  number  of  events. 

Let  us  compare  the  above  with  the  outcome  if  the  Inspector  uses  the 
strategy  of  allotting  equal  probabilities  to  all  events,  and  the  Violator 
uses  the  strategy  of  scheduling  a  violation  one  period  after  the  inspection. 
Then  the  expected  loss  by  the  Inspector  for  this  strategy  is  (n— l)/n,  as 
compared  with  (n— l)/(n+c)  for  the  optimal  strategy. 

If  c  is  small  compared  with  n,  then  the  two  results  (expectedl 
losses)  are  both  the  same.  If  c  is  comparable  in  size  to  n,  however, 
then  the  optimal  strategy  gives  a  substantial  improvement.  Thus  if  there 
is  a  great  loss  to  the  Violator  in  being  detected,  then  the  Inspector  gains 
much  by  following  an  optimal  strategy. 

From  (17)  we  observe  that,  for  optimal  inspection,  the  larger  the 
value  of  c  the  more  likely  the  Inspector  will  defer  his  inspection  to  the 
later  periods.  Further,  the  probability  is  (l+c)/(n+c)  that  the  Inspector 
defers  his  inspection  to  the  last  period. 


